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Abstract 



The magnitude of m u , rrid and m s is discussed on the basis of Chiral 
Perturbation Theory. In particular, the claim that m u = leads to a coherent 
picture for the low energy structure of QCD is examined in detail. It is 
pointed out that this picture leads to violent flavour asymmetries in the matrix 
elements of the scalar and pseudoscalar operators, which are in conflict with 
the hypothesis that the light quark masses may be treated as perturbations. 
[Talk delivered at the 2nd IFT Workshop, "Yukawa Couplings and the Origins 
of Mass", Gainesville, Florida, Feb. 1994, to be published by International 
Press. 1 
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1 Effective low energy theory of QCD 



At low energies, the behaviour of scattering amplitudes or current matrix elements 
can be described in terms of a Taylor series expansion in powers of the momenta. 
The electromagnetic form factor of the pion, e.g., may be exanded in powers of the 
momentum transfer t. In this case, the first two Taylor coefficients are related to the 
total charge of the particle and to the mean square radius of the charge distribution, 
respectively, 

^ + (t) = l + i(rV* + 0(t 2 ) . (i) 

Scattering lengths and effective ranges are analogous low energy constants occurring 
in the Taylor series expansion of scattering amplitudes. 

For the straightforward expansion in powers of the momenta to hold it is essential 
that the theory does not contain massless particles. The exchange of photons, e.g., 
gives rise to Coulomb scattering, described by an amplitude of the form e 2 /(p' — 
p) 2 which does not admit a Taylor series expansion. Now, QCD does not contain 
massless particles, but it does contain very light ones: pions. The occurrence of 
light particles gives rise to singularities in the low energy domain which limit the 
range of validity of the Taylor series representation. The form factor f n +(t), e.g., 
contains a cut starting at t = AM 2 , such that the formula ([!]) provides an adequate 
representation only for t <C AM 2 . To extend this representation to larger momenta, 
one needs to account for the singularities generated by the pions. This can be 
done, because the reason why M n is so small is understood: the pions are the 
Goldstone bosons of a hidden, approximate symmetry. The low energy singularities 
generated by the remaining members of the pseudoscalar octet (K^, K°, K°, rj) can 
be dealt with in the same manner, exploiting the fact that the Hamiltonian of 
QCD is approximately invariant under SU(3) R x SU(3) L . If the three light quark 
flavours u, d, s, were massless, this symmetry would be an exact one. In reality, chiral 
symmetry is broken by the quark mass term ocurring in the QCD Hamiltonian 

Hqpb = H + Hi , Hi = Jd 3 x{m u uu + m d dd + m s ss} . 

For yet unknown reasons, the masses m u ,m d ,m s however happen to be small, such 
that Hi can be treated as a perturbation. First order perturbation theory shows 
that the expansion of the square of the pion mass in powers of m u ,m d ,m s starts 
with 

M 2 + = (m u + m d )B{l + 0(m u ,m d ,m s )} , (2) 
while, for the kaon, the leading term contains the mass of the strange quark, 

M 2 K+ = (m u + m s )B + ... M 2 K o = (m d + m s )B + ... (3) 

This explains why the pseudoscalar octet contains the eight lightest hadrons and 
why the mass pattern of this multiplet very strongly breaks eightfold way symmetry: 
M 2 , M\ and M 2 are proportional to combinations of quark masses, which are small 
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but very different from one another, m s 3> > m u . For all other multiplets of 
SU(3), the main contribution to the mass is given by the eigenvalue of H and is of 
order A QCD , while Hi merely generates a correction which splits the multiplet, the 
state with the largest matrix element of ss ending up at the top. 

The effective field theory combines the expansion in powers of momenta with 
the expansion in powers of m u ,ma,m s . The resulting new improved Taylor series, 
which explicitly accounts for the singularities generated by the Goldstone bosons, 
is referred to as chiral perturbation theory (%PT). It provides a solid mathematical 
basis for what used to be called the "PCAC hypothesis" fj], |2|, |3|, §]]. 

It does not appear to be possible to account for the singularities generated by 
the next heavier bound states, the vector mesons, in an equally satisfactory manner. 
The mass of the p-meson is of the order of the scale of QCD and cannot consistently 
be treated as a small quantity. Although the vector meson dominance hypothesis 
does lead to valid estimates (an example is given below), a coherent framework 
which treats these estimates as leading terms of a systematic approximation scheme 
is not in sight. 

The effective low energy theory replaces the quark and gluon fields of QCD by a 
set of pseudoscalar fields describing the degrees of freedom of the Goldstone bosons 
7i,K,r]. It is convenient to collect these fields in a 3 x 3 matrix U(x) GSU(3). 
Accordingly, the Lagrangian of QCD is replaced by an effective Lagrangian, which 
only involves the field U(x) and its derivatives. The most remarkable point here 
is that this procedure does not mutilate the theory: if the effective Lagrangian is 
chosen properly, the effective theory is mathematically equivalent to QCD ||T], f3j. 

On the level of the effective Lagrangian, the combined expansion introduced 
above amounts to an expansion in powers of derivatives and powers of the quark 
mass matrix 

/ rn u 
m = m d 

V m s 

Lorentz invariance and chiral symmetry very strongly constrain the form of the 
terms occurring in this expansion. Counting m like two powers of momenta, the 
expansion starts at 0(p 2 ) and only contains even terms 

£*ff = £eff + £eff + ^Iff + ■ ■ ■ 

The leading contribution is of the form 

C 2 eff = \Fhi{d^U + d»U} + \F 2 Bti{m{U + U + )} (4) 

and involves two independent coupling constants: the pion decay constant F and 
the constant B occurring in the mass formulae (0) and ([3]). The expression ([|) 
represents a compact summary of the soft pion theorems established in the 1960's: 
the leading terms in the chiral expansion of the scattering amplitudes and current 
matrix elements are given by the tree graphs of this Lagrangian. 
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At order p 4 , the effective Lagrangian contains terms with four derivatives such 

as 

C% f = L 1 [ti{d tl U + d»U}} 2 + ... 

as well as terms with one or two powers of m. Altogether, ten coupling constants 
occur H, denoted Li, . . . ,L W . Four of these are needed to specify the scattering 
matrix to first nonleading order. The terms of order m 2 in the meson mass formulae 
(0) and (^|) involve another three of these constants. The remaining three couplings 
concern current matrix elements. 

As an illustration, consider again the e.m. form factor f n +(t). To order p 2 , the 
chiral representation reads |§ 

U+(t) = 1 + j^{2L 9 + 20 w (t) + <j> K (t)} + 0(t 2 , tm) (5) 

In this example, the leading term (tree graph of C^) is trivial, because ^+(0) 
represents the charge of the particle. At order p 2 , there are two contributions: the 
term linear in t arises from a tree graph of and involves the coupling constant 
L 9 , while the functions 4> w (t) and ^xif) originate in one loop graphs generated by 
The loop integrals contain a logarithmic divergence which is absorbed in a 
renormalization of L 9 - the net result for f v + (t) is independent of the regularization 
used. The representation (|5[) shows how the straightforward Taylor series ([!]) is 
modified by the singularites due to irn and KK intermediate states. At the order of 
the chiral expansion we are considering here, these singularities are described by the 
one loop integrals ^(t), (pxif) which contain cuts starting at t = 4M 2 and t = ^M^, 
respectively. The result (^]) also shows that chiral symmetry does not determine the 
pion charge radius: its magnitude depends on the value of the coupling constant 
L 9 - the effective Lagrangian is consistent with chiral symmetry for any value of 
the coupling constants. The symmetry, however, relates different observables. The 
slope of the K [3 form factor /+(£), e.g., is also fixed by L 9 . The experimental value 
of this slope 0, A + = 0.030, can therefore be used to first determine the magnitude 
of Lg and then to calculate the pion charge radius. This gives (r 2 ) 7T + = 0.42 fm 2 , to 
be compared with the experimental result, 0.44 fm 2 0. 

2 Scale of the loop expansion 

One of the main problems encountered in the effective Lagrangian approach is the 
occurrence of an entire fauna of effective coupling constants. If these constants are 
treated as totally arbitrary parameters, the predictive power of the method is equal 
to zero - as a bare minimum, an estimate of their order of magnitude is needed. 

Let me first drop the masses of the light quarks and send the heavy ones to 
infinity. In this limit, QCD is a theoretician's paradise: a theory without ad- 
justable dimensionless parameters. In particular, the effective coupling constants 
F, B, Li, L 2 , . . . are calculable — the available, admittedly crude evaluations of F 
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and B on the lattice demonstrate that the calculation is even feasible. As discussed 
above, the coupling constants L\, L 2 , . . . are renormalized by the logarithmic diver- 
gences occurring in the one loop graphs. This property sheds considerable light on 
the structure of the chiral expansion and provides a rough estimate for the order of 
magnitude of the effective coupling constants |7j . The point is that the contributions 
generated by the loop graphs are smaller than the leading (tree graph) contribution 
only for momenta in the range | p | ^A x , where || 

A x = 4nF/jN~ f (6) 

is the scale occurring in the coefficient of the logarithmic divergence (Nf is the 
number of light quark flavours). This indicates that the derivative expansion is 
an expansion in powers of (p/A x ) 2 with coefficients of order one. The stability 
argument also applies to the expansion in powers of m u , and m s , indicating that 
the relevant expansion parameter is given by (M n /A x ) 2 and (M^/A x ) 2 , respectively. 



3 Low lying excited states 

A more quantitative picture can be obtained along the following lines. Consider 
again the e.m. form factor of the pion and compare the chiral representation ([5]) 
with the dispersion relation 

dt' 



I r°° dt 

71 J AMI t —t 



In this relation, the contributions <ft n , <j)x from the one loop graphs of %PT correspond 
to 7T7T and KK intermediate states. To leading order in the chiral expansion, the 
corresponding imaginary parts are slowly rising functions of t. The most prominent 
contribution on the r.h.s., however, stems from the region of the p-resonance which 
nearly saturates the integral: the vector meson dominance formula, f-K+{t) — (1 — 
t/M 2 ) -1 , which results if all other contributions are dropped, provides a perfectly 
decent representation of the form factor for small values of t. In particular, this 
formula predicts (r 2 ) n + = 0.39 fm 2 , in satisfactory agreement with observation (0.44 
fm 2 ). This implies that the effective coupling constant L 9 is approximately given by 



19 ~ 2Mf • (7) 

In the channel under consideration, the pole due to p exchange thus represents the 
dominating low energy singularity - the tttt and KK cuts merely generate a small 
correction. More generally, the validity of the vector meson dominance formula 
shows that, for the e.m. form factor, the scale of the derivative expansion is set by 
M p = 770 MeV. 
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Analogous estimates can be given for all effective coupling constants at order p 4 , 
saturating suitable dispersion relations with contributions from resonances 0, [K|, 
e.g. 

L " = AMI ' Ll = ~48Mj, ' (8) 

where Ms — 980 MeV and M v > = 958 MeV are the masses of the scalar octet and 
pseudoscalar singlet, respectively. In all those cases where direct phenomenological 
information is available, these estimates do remarkably well. I conclude that the 
observed low energy structure is dominated by the poles and cuts generated by the 
lightest particles - hardly a surprise. 



4 Magnitude of the effective coupling constants 

The effective theory is constructed on the asymptotic states of QCD. In the sector 
with zero baryon number, charm, beauty, . . . , the Goldstone bosons form a com- 
plete set of such states, all other mesons being unstable against decay into these 
(strictly speaking, the f] occurs among the asymptotic states only for = m u ; it 
must be included among the degrees of freedom of the effective theory, nevertheless, 
because the masses of the light quarks are treated as a perturbation — in massless 
QCD, the poles generated by the exchange of this particle occur at p = 0). The 
Goldstone degrees of freedom are explicitly accounted for in the effective theory - 
they represent the dynamical variables. All other levels manifest themselves only 
indirectly, through the values of the effective coupling constants. In particular, low 
lying levels such as the p generate relatively small energy denominators, giving rise 
to relatively large contributions to some of these coupling constants. 

In some channels, the scale of the chiral expansion is set by M p , in others by 
the masses of the scalar or pseudoscalar resonances occurring around 1 GeV. This 
confirms the rough estimate @. The cuts generated by Goldstone pairs are signifi- 
cant in some cases and are negligible in others, depending on the numerical value of 
the relevant Clebsch-Gordan coefficient. If this coefficient turns out to be large, the 
coupling constant in question is sensitive to the renormalization scale used in the 
loop graphs. The corresponding pole dominance formula is then somewhat fuzzy, 
because the prediction depends on how the resonance is split from the continuum 
underneath it. 

The quantitative estimates of the effective couplings given above explain why it 
is justified to treat m s as a perturbation. At order p 4 , the symmetry breaking part of 
the effective Lagrangian is determined by the constants L 4 , . . . , L 8 . These constants 
are immune to the low energy singularities generated by spin 1 resonances, but are 
affected by the exchange of scalar or pseudoscalar particles. Their magnitude is 
therefore determined by the scale Ms — M v > ~ 1 GeV. Accordingly, the expansion 
in powers of m s is controlled by the parameter [M^j 'Ms) 2 — \. The asymmetry in 
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the decay constants, e.g., is given by |TIJ 

where the term "xlogs" stands for the chiral logarithms generated by intermediate 
states with two Goldstone bosons. This shows that the breaking of the chiral and 
eightfold way symmetries is controlled by the mass ratio of the Goldstone bosons to 
the non-Goldstone states of spin zero. In x?T, the observation that the Goldstones 
are the lightest hadrons thus acquires quantitative significance. 



5 Light quark masses 



A crude estimate for the order of magnitude of the light quark masses was given 
shortly after the discovery of QCD |TT[ : 



m. 



4MeV 



rrid — 6 MeV 



m.. 



125 - 150 MeV 



Many papers dealing with the issue have appared since then. Weinberg [|12| pointed 
out that xPT leads to an improved estimate for the ratios m u : rrid '■ rn s . Using 
the Dashen theorem [13] to account for e.m. self energies, the lowest order mass 
formulae given in eqs. (§) and @ imply [|12|] 



m d 
m s 
md 



M 2 K+ 



M| + 2M2 - Ml 



M 2 K0 



-Ml + +Ml 



M 2 K0 



+ Mf 



K+ 



Ml 



M 2 R0 



+ M 2 



(10) 



Numerically, this gives 



— = 0.55 
rrid 



— = 20 

rrid 



The higher order terms in the chiral expansion generate corrections to the mass 
formulae (|10|), controlled by the parameter (Mx/A x ) 2 . Roughly, the numerical 
result flnp should therefore hold to within 20 or 30%. 

The corrections of 0(p 4 ) to the above mass formulae were worked out some time 
ago ||]. In particular, it was shown that these corrections drop out when taking the 
double ratio 

M 2 



2_M 2 K 



Q = 



Ml 



M 2 



M| 



Mh 



(12) 



The observed values of the meson masses thus provide a tight constraint on one 
particular ratio of quark masses, 



Q 



1 " 2 

2 m s — m 



mf 



{l + 0(m 2 )} 



(13) 
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with rh = \{m u + rrid). The constraint may be visualized by plotting the ratio 
Trig/ma versus m u jm d ||14j| . Dropping the corrections of 0(m 2 ) = 0(M^/A^), the 
resulting curve takes the form of an ellipse, 

W'+^(5)'=l, (14) 
\m d J Q z \m d J 

with Q as major semi-axis (the term m 2 /m 2 has been discarded, as it is numerically 
very small). The meson masses occurring in the double ratio flT2| ) refer to pure 
QCD. Using the Dashen theorem to correct for the e.m. self energies, one obtains 
Q = 24.1. For this value of the semi-axis, the ellipse passes through the point 
specified by Weinberg's mass ratios, eq. (PI). 

6 Corrections to the Dashen Theorem 

The Dashen theorem is subject to corrections from higher order terms in the chiral 
expansion. As usual, there are two categories of contributions: loop graphs of order 
e 2 m and terms of the same order from the derivative expansion of the effective e.m. 
Lagrangian. 

The Clebsch-Gordan coefficients occurring in the loop graphs are known to be 
large, indicating that two-particle intermediate states generate sizeable corrections; 
the corresponding chiral logarithms tend to increase the e.m. contribution to the 
kaon mass difference [|TjJ. The numerical result depends on the scale used when 
evaluating the logarithms. In fact, taken by themselves, chiral logs are unsafe at 
any scale. 

The magnitude of the contributions from the terms of order e 2 m occurring in the 
effective Lagrangian is estimated in two recent papers by Donoghue et al. [TO and 



Bijnens fPT |. Although the framework used is rather different, they come up with the 
same conclusion: the corrections are large, increasing the value (M K + — M K o) e . m . = 
1.3 MeV predicted by Dashen to 2.3 MeV (Donoghue et al.) and 2.6 MeV (Bijnens), 
respectively. 

In the present context, the main point is that even large corrections of this size 
only lead to a small change in the value of Q, because the mass difference between 
K + and K° is predominantly due to m d > m u : the value Q = 24.1 (Dashen) 
is lowered to Q = 22.1 (Donoghue et al.) and Q = 21.6 (Bijnens), respectively. 
Expressed in terms of the error bars attached to the value of Q in [HJ, a doubling 
of the electromagnetic self-energy modifies the value of Q by l|er. 

The decay rj — > 37r provides an independent measurement of Q: writing the decay 
rate in the form T^^+^-^o = T /Q A , %PT predicts the value of T in a parameter 
free manner ||. Although the calculation accounts for all corrections of 0(p 4 ), the 
numerical accuracy is rather modest because, due to strong final state interaction 
effects, the calculated corrections are quite large. Since the quantity Q enters in the 
fourth power, the value T^^+^-^o = 283 ± 28 eV given by the particle data group 
|| still yields a decent measurement: Q = 20.6 ± 1.7. The fact that this result is 
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significantly smaller than the value predicted with the Dashen theorem represents 
an old puzzle. The problem disappears if the e.m. contribution to the kaon mass 

120]. 



18, 19 



difference is significantly larger than indicated by the Dashen theorem 
In particular, the values of Q obtained in fl6| , [17] are consistent with the one from 
rj decay. 

The theoretical uncertainties in the rj decay amplitude could be reduced. The 
calculation referred to above only accounts for the corrections of order p 4 and in- 
cludes final state interaction effects and r}T]' mixing only to that order. A dispersive 
analysis along the lines indicated by Khuri and Treiman plf , which uses the 
predictions only for the subtraction constants would likely lead to a more accurate 



estimate of the major semi-axis [22 



7 The ratio m M /m^ 



Chiral perturbation theory thus fixes one of the two quark mass ratios in terms of 
the other, to within small uncertainties. The ratios themselves, i.e., the position 



on the ellipse, are a more subtle issue. Kaplan and Manohar [14] pointed out that 
the corrections to the lowest order result ([11]) for m M /m^ cannot be determined 
on purely phenomenological grounds. They argued that these corrections might 
be large and that the w-quark might actually be massless. This possibility is of 
particular interest, because the strong CP problem would then disappear. Several 
authors [^3| have given arguments in favour of m u = 0. 



Let me show the picture this reasoning leads to. The lowest order mass formulae 
©) (H) imply that the ratio m u /md determines the K°/K + mass difference, the 
scale being set by M T : 

M 2 K0 - M 2 K+ = md ~ mu . Ml + . . . (15) 
m u + m d 

The formula holds up to corrections from higher order terms in the chiral expansion 



and up to e.m. contributions. It is illustrated in figure 1, taken from ||24|| . The upper 
curve corresponds to the value (M&-+ — M^o) e m . = 1.3 MeV, which follows from the 
Dashen theorem. The correction of Donoghue et al. [|16|] shifts the result by 1 MeV 
(lower curve). The horizontal line is the experimental value. Hence the corrections 
from the higher order terms must generate the contributions shown by the arrows. 
In particular, if m u is assumed to vanish, the lowest order mass formula predicts a 
mass difference which exceeds the observed value by a factor of four. The disaster 
can only be blamed on the "corrections" from the higher order terms. It is evident 
that, under such circumstances, it does not make sense to truncate the expansion at 
first nonleading order and to fool around with the numerics of the effective coupling 
constants occurring therein. The conclusion to draw from the assumption m u = 
is that ;\TT is unable to account for the masses of the Goldstone bosons. The fact 
that it happens to work remarkably well in other cases must then be accidental. I 
prefer to conclude that the assumption m u = is not tenable. 



8 



Figure 1: Sensitivity of the first order prediction for the kaon mass difference to 
^n u / m d- The two curves differ in the estimate used for the e.m. self energies. E is 
the experimental value. The dot corresponds to Weinberg's leading order result. 



8 Phenomenological ambiguities 

The preceding presentation is an extract of a rapporteur talk given in 1992 [[24]]. I 
thought that this coffee was rather cold by now and dealt with it only briefly in the 
manner described above, concentrating the remainder of the talk on issues which I 
expected to be more interesting to the audience, such as the role of winding number 
and the mass of the rf |26|. The vigorous discussion triggered by the claim that 
m u = is firmly ruled out by now and the remarkable statements made by some of 
the participants indicate, however, that not everyone fully shares this view of the 
matter. I conclude that the arguments given previously do not suffice and elaborate 
a little further. 

To set up notation, I first briefly review the observation of Kaplan and Manohar 



14| , who pointed out that the matrix 

m! = aim + a2ijn + )~ 1 det m 

transforms in the same manner as m. For a real, diagonal mass matrix, the trans- 
formation amounts to 

m' u = a.\m u + a 2 m rf m s (cycl. u — > d — > s — > u) . (16) 

Symmetry alone does therefore not distinguish w! from m. If £ e ff(U, dU, . . . ,m) is 
an effective Lagrangian consistent with chiral symmetry, so is £ e ff(U, dU, . . . , mf). 

Since only the product Bm enters the Lagrangian, a\ merely changes the value 
of the constant B. The term proportional to ai is a correction of order m 2 which, 
upon insertion in C 2 e ^ generates a contribution to The contribution can again 
be removed by changing some of the coupling constants: 

B' = B/ai , L' e = L 6 -a , L' 7 = L 7 - a , L' 8 = L 8 + 2a , (17) 
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with a = a2F 2 /32a\B. The effective Lagrangian is therefore invariant under a 
simultaneous change of the quark mass matrix and of the coupling constants. Ac- 
cordingly, the meson masses and scattering amplitudes which one calculates with 
this Lagrangian remain the same. The elliptic constraint considered above, e.g., is 
invariant under the operation (up to terms of order (m u — ma) 2 /™, 2 , which were 
neglected). The chiral representation for the Green functions of the vector and ax- 
ial currents are also invariant. Since there is experimental information only about 
masses, scattering amplitudes and matrix elements of the electromagnetic or weak 
currents and since the chiral representation for these does not distinguish m,B,Li 
from m' , B', phenomenology does not allow one to determine the magnitude of 
the constants B, L 6 , L 7 , L 8 . 

We are not dealing with a hidden symmetry of QCD here — this theory is not 
invariant under the change (|T6|) of the quark masses. In particular, the matrix 
elements of the scalar and pseudoscalar operators are modified. Consider, e.g., the 
vacuum-to-pion matrix element of the pseudoscalar density. The Ward identity for 
the axial current implies that this matrix element is given by 

(0| di l5 u\ir + ) = V2F n+ M 2 + /(m u + m d ) (18) 

The relation is exact, except for electroweak corrections. It involves the physical 
quark masses and is not invariant under the above transformation. For the ratio 
of the K and it matrix elements, the effective Lagrangian yields the following low 
energy representation: 

(01 si lb u\K + ) , A(M 2 + -M 2 +) , tT rx , ^. 2N 
\ n t. 4 = 1 + * P2 " ^8 - L 5 ) + X logs + O m 2 . 
(0| di^z,u\it + ) F z 

The relation is analogous to the formula for the ratio of the corresponding matrix 
elements of the axial currents, 

^ = i + 4(M| v -M; + ) L5+xlogs+0(m2) 

While the ratio of decay constants is invariant under the KM transformation, the 
one of the pseudoscalar densities is not. The main difference between the two cases 
is that nature is kind enough to provide us with a weak interaction probe, testing 
the matrix elements of the vector and axial currents at low energies, while a probe 
which would test those of the scalar and pseudoscalar currents is not available - 
the Higgs particle is too heavy for this purpose. The observed rates of the decays 
K — > fiu and n —>■ fiu imply F^+ / F 7T + = 1.22 and thereby permit a phenomenological 
determination of the effective coupling constant L5, while L$ cannot be determined 
on purely phenomenological grounds. 

If the electroweak interactions were not available to probe the low energy struc- 
ture of QCD, the coupling constant L 5 would also count amoung those quantities 



10 



for which direct phenomenological information is absent, for the following reason. 
The field 



U' = U{1 + a 3 {U^ m — m)U) + 0(m 2 )} (19) 

transforms in the same manner as U. For a constant quark mass matrix, the change 
of variables U — > Z7' is equivalent to a change of the effective coupling constants: 

L' 5 = L 5 - 6a , L' 7 = L 7 + a , L 8 — 3a , a = F 2 a 3 /2AB . (20) 

Hence the results for the masses of the Goldstone bosons and for their scattering 
amplitudes only involve those combinations of coupling constants which are invariant 
under this operation. The only difference to the transformation considered above 
is that (|I7D leaves the matrix elements of the vector and axial currents invariant, 



while (|20D does not. 

The ambiguity pointed out by Kaplan and Manohar does not indicate that the 
effective Lagrangian possesses any symmetries beyond those of the QCD Lagrangian. 
It does not concern QCD as such, but originates in the fact that the electromag- 
netic and weak interactions happen to probe the low energy structure of the system 
exclusively through vector and axial currents. 



9 Additive renormalization of m and all that 

One of the claims repeatedly made in the discussion is that the quark masses oc- 
curring in the effective Lagrangian must be distinguished from those entering the 
Lagrangian of QCD. Indeed, this problem does arise within early formulations of 
the effective Lagrangian technique, which exclusively dealt with the properties of 
the Goldstone bosons on the mass shell. There, the structure of the effective La- 
grangian was inferred from global symmetry arguments and the only place where the 
quark mass matrix entered was through the transformation law of the term which 
explicitly breaks the symmetries of the QCD Hamiltonian. 

The framework used in current work on xPT, however, does not rely on global 
symmetry arguments, but identifies the quark masses through their contributions 
to the Ward identities, which express the symmetries of the underlying theory in 
local form. The method was introduced ten years ago HI- It is not limited to 
on-shell matrix elements, but also specifies the low energy expansion of the Green 
functions formed with the vector, axial, scalar and pseudoscalar currents. The Ward 
identities involve the physical quark masses, not some effective low energy version 
thereof. For the effective theory to reproduce the low energy structure of the Green 
functions, the mass matrix occurring in the effective Lagrangian must be identified 
with the physical one. Likewise, the magnitude of the effective coupling constants 
Lq, L 7 and L 8 is not an inherently ambiguous issue; the effective theory reproduces 
the low energy expansion of QCD for precisely one value of these constants. 
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There is a distinction between the currents associated with chiral symmetry 
and the scalar or pseudoscalar densities in that the latter pick up multiplicative 
renormalization, contragredient to the renormalization of the quark mass matrix. It 
is crucial that the renormalization used is mass independent, such that the Green 
functions of the scalar and pseudoscalar operators are given by the response of the 
effective action to a local change in the quark mass matrix. In the ratio of matrix 
elements considered above, the renormalization scale then drops out — this ratio 
is a perfectly well-defined pure number, which may be estimated, e.g., by means of 
QCD sum rule techniques. In fact, the sum rule for the two-point functions of the 
pseudoscalar currents is one of the main sources of information available today for 
the absolute magnitude of the light quark masses. Once numerical evaluations on a 
lattice reach sufficiently small quark masses, this method will allow a more precise 
determination of low energy matrix elements involving the scalar and pseudoscalar 
currents. 

Some authors |27] have pointed out that instantons offer a physical interpreta- 
tion for the KM-ambiguity. In the field of an instanton, the Dirac operator develops 



discrete zero modes. As pointed out by t'Hooft pq| , these modes undergo an effec- 
tive interaction which indeed generates a self-energy contribution for the up-quark 
proportional to m^m s . The problem with this picture is that the spectrum of the 
Dirac operator on which it is based is fictitious. Banks and Casher |29] have shown 
that the spontaneous breakdown of chiral symmetry requires the small eigenvalues 
of the Dirac operator to be distributed uniformly: the level density p(A) approaches 
the value p(0) = |(0| qq 1 0) | / tt when A — > 0. For the model described in P7| , the 
spectrum instead contains an isolated zero mode, such that the level density develops 
a peak there, p(X) cx 5(A). The peak may generate large flavour symmetry breaking 
effects, but at the same time, it shows that the model is in conflict with spontaneous 
chiral symmetry breakdown. The calculations carried out in [|27| are based on the 
assumption that a framework which is unable to account for the leading term in the 
effective Lagrangian (the one related to the constant B <-> (0| qq |0)) can be trusted 
when analyzing the higher order terms, i.e., the corrections. 

There is some progress in understanding the level structure in a picture which 
describes the vacuum as a dense collection of instantons [26|. In that framework, 



the zero modes of the individual instantons merge into a continuum, such that the 
fictitious peak mentioned above disappears. For models with a decent spectrum, 
the symmetry breaking effects should be of reasonable size. 



10 The K° — K + mass difference 



-+ 



As pointed out in [24], the lowest order x?T formula for the K° — K + mass difference 
disagrees with observation by a factor of four — if m u is set equal to zero. Since I 
did not find this puzzle discussed in the literature p3| , I do it myself and first try 
to guess the reasoning put forward by the defence. 

Any lawyer worth a fraction of his fee would immediately point out that the 
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problem disappears if the quark masses occurring in the lowest order formula are 
replaced by suitable KM-transforms m' u , m' d ,m' s . In particular, one may take ai = 
1 and fix «2 in such a manner that the ratio of the primed masses agrees with 
the Weinberg ratios. This amounts to the statement that the chiral perturbation 
series should be reordered, replacing the expansion in powers of the physical quark 
masses by an expansion in the primed ones. With the above specification of these, 
the leading term of the reordered series agrees with the experimental value of the 
K° — K + mass difference, by construction. So, the first conclusion to draw from 
m u = is that the expansion of the meson masses in powers of the physical quark 
masses does not make sense — %PT must be replaced by a reordered series. Why 
does this not take care of the problem ? 

The point is that the operation very strongly distorts the matrix elements of the 
scalar operators, which are obtained from the effective action by expanding in powers 
of the physical mass matrix, not by some hand made variant thereof. Consider, e.g., 
the scalar form factors 

(K + \uu - dd\K + ) = S K +(t) , (n + \uu-ss\Tt + } = S n+ (t) . 

Since the operation s <->• d takes a K + into a 7r + , the two form factors coincide in the 
SU(3) limit. The difference Sx+(t) — S n +(t) is a symmetry breaking effect of order 
m s — m d . According to the Feynman-Hellman theorem [jSOfl , the values of these form 
factors at t — are related to the derivatives of the meson masses with respect to 
the quark masses. The kaon matrix element of the operator uu, e.g., represents the 
first derivative of Mj^+ with respect to m u , such that 

^ K+ (®} (<9m u drrid) K+ ' ^+(0) i^dm u dm^j n+ 

The expansion of the meson masses in powers of m u , m d , m s starts with [[| 

m k+ = { m u + m s )B{l + (m u + m s )K 3 + (m u + m d + m s )K A + . . . } 
Ml+ = (m u + m d )B{l + (m u + m d )K 3 + (m u + m d + m s )K^+ . . . } , 

where chiral logarithms and terms of 0(m 3 ) are dropped. Evaluating the derivatives, 
one readily establishes the low energy theorem 

r = = ( ms - mu ■ M ^~ M kA 2 d + (m*)\ (21) 

-MO) \m d -m u Mh-M^l \ +U{ } \ ' { ) 

valid up to chiral logarithms (in the present case, these are proportional to M% and 
therefore tiny, of order 1%). The theorem is of the same character as the one which 
leads to the elliptic constraint: the corrections are of second order in the quark 
masses. The relevant combination of quark masses which occurs here, however, fails 
to be invariant under the KM-transformation; the relation only holds if the masses 
entering the ratio (m s — m u )/(m d — m u ) are identified with the physical ones. 
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Suppose now that m u vanishes. The formula then predicts that the kaon matrix 
element is about three times smaller than the pion matrix element (the precise value 
of r depends on the electromagnetic corrections: using the value (M K o — M k +)qpd = 
5.3 MeV, which follows from the Dashen theorem, I obtain r = 0.30, while the values 



(M K o — M k +)qp-d = 6.3 MeV and 6.6 MeV found by Donoghue et al. ||T6[ and Bijnens 
Jl7j lead to r = 0.36 and r = 0.38, respectively). So, m u = leads to the prediction 
that the evaluation of the above matrix elements with sum rule or lattice techniques 
will reveal extraordinarily strong flavour symmetry breaking effects. 

Massless up-quarks usually come in a luxurious wrapping with plausible well- 
known statements of general nature. Indeed, there is no logical contradiction with 
the proposal that (i) the expansion of the meson masses in powers of the physical 
quark masses fails and (ii) a suitable reordering of the series leads to meaningful 
results. Also, the prediction that the matrix elements of the scalar and pseudoscalar 
operators exhibit dramatic flavour symmetry breaking effects is not in conflict with 
phenomenology. Although theoretical tools such as sum rules lead to the conclusion 
that the picture does not make sense, these are more fragile than solid experimental 
facts. It is conceivable that the present example represents an exception to Einstein's 
statement: "Ramniert ist der Herrgott, aber boshaft ist er nicht." Even then, the 
literature on massless up-quarks leaves much to be desired, as it does not address any 
of the consequences for the low energy structure of QCD, which are rather bizarre. 



11 Does the quark condensate vanish ? 

Let me finally turn to an entirely different picture [3T|, which I do not believe 
either, but find conceptually more interesting. The picture may be motivated by an 
analogy with spontaneous magnetization. There, spontaneous symmetry breakdown 
occurs in two quite different modes: ferromagnets and antiferromagnets. For the 
former, the magnetization develops a nonzero expectation value, while for the latter, 
this does not happen. In either case, the symmetry is spontaneosly broken (for a 
discussion of the phenomenon within the effective Lagrangian framework, see p^j). 
The example illustrates that operators which the symmetry allows to pick up an 
expectation value may, but need not do so. 

The standard low energy analysis assumes that the quark condensate is the lead- 
ing order parameter of the spontaneously broken symmetry, such that the Goldstone 
boson masses are proportional to the square root of the quark mass. The propor- 



tionality coefficient follows from the relation of Gell-Mann, Oakes and Renner 

FlMl = (m u + m d ) | (0\uu\0) | + ... 

The simplest version of the question addressed in the papers quoted above is whether 
the quark condensate indeed tends to a nonzero limit when the quark masses are 
turned off, like the magnetization of a ferromagnet, or whether it tends to zero, as 
it is the case for the magnetization of an antiferromagnet. If the second option were 
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realized in nature, the pion mass would not be determined by the quark condensate, 
but by the terms of order m 2 , which the Gell-Mann-Oakes-Renner formula neglects. 
More generally, one may envisage a situation, where the quark condensate is differ- 
ent from zero, but small, such that, at the physical value of the quark masses, the 
terms of order m and m 2 both yield a significant contribution. In the literature, this 
framework is referred to as "Generalized xPT" [31]. The "generalized" and "ordi- 
nary" chiral perturbation theories only differ in the manner in which the symmetry 
breaking effects are treated. 

The main problem with G%PT is that much of the predictive power of the stan- 
dard framework is then lost. The most prominent example is the Gell-Mann-Okubo 
formula for the octet of Goldstone bosons, which does not follow within G%PT. As 
mentioned above, this prediction of the standard framework is satisfied remarkably 
well, thus supporting that picture, but it is not unfair to say that this may be a 
coincidence, like the AJ = --rule, whose explanation within the Standard Model is 
also considerably more complex than the rule itself. The original motivation for the 
study of the generalized setting was the discrepancy between theory and experiment 
with regard to the 7rN cr-term, which indeed represented quite a fountain, contin- 
uously spitting out new ideas, one more strange than the other. In the meantime, 
this fountain has run dry, because the discrepancy was shown to be due to an inad- 
equate treatment of a form factor, which enters the relation between the scattering 
amplitude and the cr-term matrix element [[34j[. From my point of view, the gener- 
alized szenario is still of some interest, mainly as a kinematical framework, which 
parametrizes the vicinity of the xPT predictions and allows one to judge the signif- 
icance of the experimental information concerning these: the standard framework 
represents a special case of the generalized one. Although this is not very practical 
in general, one may even put the cart before the horse and claim that the predic- 
tions of xPT are based on extra assumptions, which the generalized setting does not 
make and which should be tested experimentally. Indeed, xPT leads to very strong 
predictions concerning, e.g., the scattering lengths of tttc scattering H35R , which are 
sensitive to the explicit breaking of chiral symmetry generated by the quark masses 
and which it is important to test experimentally. 



12 Summary and Conclusion 

It is difficult for me to summarize the present knowledge concerning the masses 
of the light quarks in an unbiased manner. The following conclusions unavoidably 
reflect my own views and mainly rely on work done together with Gasser. These 
views are dyed with the prejudice that a straightforward expansion of the matrix 
elements in powers of the light quark masses makes sense without reordering the 
series and that the Gell-Mann-Oakes-Renner relation is not spoiled by higher order 
effects. 

1. Concerning the absolute magnitude of the quark masses, the most reliable 
estimates are based on evaluations of QCD sum rules J36|. With the experience 
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Figure 2: The elliptic band shows the range permitted by the low energy theorem 
(14). The cross-hatched area is the intersection of this band with the sector se- 
lected by the phenomenology of ^'-mixing. The shaded wedge shows the constraint 
imposed on the ratio R by the mass splittings in the baryon octet. 



gained from the various applications of these, the evaluation leads to a remarkably 
stable result, the visible noise generated by uncertainties of the input being quite 
small [37]. The main uncertainty stems from the systematic error of the method, 
which it is hard to narrow down. The results do not indicate a significant change as 
compared to the estimates given in J38|. In the long run, lattice evaluations should 



allow a more accurate determination, but further progress with light dynamical 
fermions is required before the numbers obtained can be taken at face value. 

2. Chiral perturbation theory constrains the ratios m s /m^ and m u /md to the 
ellipse specified in equation ([L4]) and displayed in figure 2 (taken from fTO]). The 
magnitude of the semiaxis Q is known to an accuracy of 10%. The value Q = 24 
shown in the figure relies on the Dashen Theorem. The estimates given in [TR 



17|j indicate that this theorem receives large corrections from higher order terms, 
reducing the value of Q by about 10%. 

3. The position on the ellipse is best discussed in terms of the ratio 



„ m s — m 
R = — 



m d - m u 



which represents the relative magnitude of SU(3) breaking compared to isospin 
breaking. If m u were to vanish, this ratio would be related to the semiaxis by 
R ~ Q ~ 24 and the ratio of the strange quark mass to rh = \{m u + m^) would 
be given by m s /m ~ 2Q ~ 48. The quark mass estimates obtained from QCD sum 
rules and from numerical evaluations on a lattice are in flat disagreement with this 
pattern, but confirm the standard picture, where m s /m ~ Q ~ 24. 

4. The value of R was investigated much before QCD had been discovered. For 
a long time, it was taken for granted that isospin breaking is an electromagnetic 



effect. The early literature on the subject is reviewed in |3q| . In 1981, Gasser |39 
analyzed the nonanalytic terms occurring in the chiral expansion of the octets of 
meson and baryon masses and demonstrated that these remove the discrepancies 
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which had affected previous determinations of the ratio R. A thorough discussion 
of the available information concerning this ratio [38| showed that the data on E + — 
£~, S° — H _ , K° — K + and pc<>mixing allow four independent determinations of 
R (in the case of the proton-neutron mass difference, the higher order corrections 
turn out to be large, so that this source of information does not significantly affect 
the overall result). Within the noise visible in the calculation, estimated at 10-15%, 
the results turned out to be mutually consistent. Adding the uncertainties of the 
individual values quadratically, the analysis implies R = 43.5 ± 2.2. Discarding the 
information from K° — K + and from pcu-mixing, one instead obtains R = 43.5 ± 3.2, 
which corresponds to the range depicted as a shaded wedge in figure 2. 

5. The above error bars rely on the prejudice that the matrix elements of the 
operators uu, dd, ss do not exhibit strong SU(3) breaking effects. These were es- 
timated on the basis of the same rough argument, which identifies the quark mass 
difference m s — rh with the mass splitting within the SU(3) multiplets ppfl . When 
analyzing the meson mass spectrum to second order in the chiral expansion ||, we 
realized, of course, that the mass spectrum of the Goldstone bosons does not suffice 
to independently determine the ratios Q and R, but the algebraic nature of the 
ambiguity was pointed out only later, by Kaplan and Manohar [fPjl . In the opinion 
of some authors, this ambiguity appears to represent the most interesting aspect of 
the matter, indicating that the notion of quark mass is fuzzy at low energies, due to 
an additive renormalization problem, generated by nonperturbative effects, such as 
instantons, etc. The detailed discussion given above is another attempt at fighting 
these dangerous ogres, which the innocent observer might take for windmills (earlier 
attempts are described in [H], Q). Gradually, Rosinante is getting sick and 
tired. 

6. One of the consequences we did infer from the second order analysis is that the 
rjr]' mixing angle cannot be determined with the Gell-Mann-Okubo formula (9^ = 
— 10°), because there are other effects of the same algebraic order of magnitude, 
originating in L\^. Using the information about the value of R discussed above, we 
obtained 8 m > = —20° ±4°. The phenomenological analysis of the decays rj — > 77 and 
r]' — > 77, performed independently a year later P^f , neatly confirmed this prediction. 

7. As noted in ||, the result for the coupling constant L7, which follows from the 
above value of R, is consistent with the r( dominance formula This indicates 
that the sum rule used in the derivation of that formula is nearly saturated by the 
r)' intermediate state also in the real world, not only in the large N c limit, where the 
formula is exact. 

8. Turning the argument around, the phenomenology on the mixing angle avail- 
able today |4i| may be used to determine the quark mass ratios. This is indicated 
by the dashed straight lines in figure 2. Since the data agree with the prediction, 
it is clear that the information about the mass ratios obtained in this manner is 
consistent with the one extracted from isospin breaking in the baryon octet. Note, 
however, that the phenomenological analysis of the 77 and 7/ decays relies on large 
N c arguments. Although the resulting picture yields a coherent understanding of 
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quite a few processes [[P], £45j, the validity of these arguments needs to be examined 
more carefully to arrive at firm conclusions |[46|| . 

9. The branching ratio T{ip' — > i/m )/r(tp' —> iprj) provides further informa- 
tion about R. Using the lowest order formula, the data available at the time gave 

= 28 i| pSfl . The formula is valid up to SU(3) breaking effects; according to 
the general rule of thumb, these are expected to be of order 20 or 30%. Since 
we saw no way of evaluating these, the information derived from this branching 
ratio did not significantly affect our analysis and was discarded. Meanwhile, the 
data are slightly more precise ||; the lowest order formula now yields R = 31 ± 4. 
Furthermore, Donoghue and Wyler j|7| have investigated the second order correc- 
tions, using the multipole expansion, which relates the relevant matrix elements to 
(0| G^G^ln ), (0| G^G^lrj). As it is the case with the sum rule used to estimate 
L 7 , these matrix elements involve pseudoscalar operators and do thus not suffer from 
the KM-ambiguity. The calculation yields remarkably small SU(3)-breaking effects, 
such that the value obtained for R remains close to 31 ± 4. 

The problems faced by this attempt at estimating R are listed in EM. In partic- 



ular, as discussed in detail in |^8|], it is not clear that the relevant matrix elements 
may be replaced by those of the operator G^ V G^ V '. The calculation is of interest, 
because it is independent of other determinations, but at the present level of theo- 
retical understanding, it is subject to considerable uncertainties. In view of these, 
it appears to me that the numerical result is perfectly consistent with the value 
R = 43.4, which follows from the Weinberg ratios. 

As evidenced by the example of rjr]' mixing and quite a few others, the estimates 
of the effective coupling constants which follow from the hypothesis that the low 
energy structure is dominated by the singularities due to the lowest lying levels lead 
to a rather predictive framework, which until now has passed all experimental tests. 
It would be of interest to estimate the symmetry breaking effects in the branching 
ratio T(tp' — > ifj7r )/T(ip' — > iprj) on the basis of this hypothesis and to see whether 
they indeed increase the value of R obtained with the lowest order formula. If not, 
there would be a problem with the simple picture I am advocating here. 

10. There are examples, where the higher order corrections turn out to be 
large, such as rj — > 3tt or scalar form factors. In all cases I know of, one can put 
the finger on the culprit responsible for the enhancement. Invariably, the problem 
arises, because the perturbations generated by the quark mass term are enhanced 
by small energy denominators. In particular, the enhancement due to strong final 
state interactions in the S-waves often generates sizeable corrections, which are 
perfectly well understood — itself predicts how large the S-wave phase shifts 
are near threshold and that they rapidly grow with energy. Once the origin of the 
phenomenon is understood, one may take it into account and arrive at a reliable low 
energy representation, even if the straightforward chiral expansion thereof contains 
relatively large contributions from higher order terms. 

11. In the case of the mass formulae, the low energy singularities do not generate 
a significant enhancement of the higher order contributions. Accordingly, the esti- 
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mates of the effective coupling constants mentioned above imply that the Weinberg 
ratios only receive small corrections. There is some evidence for a 10% decrease in 
Q, related to the e.m. self energies: The Dashen theorem is suspected to receive 
large corrections, because some of the low energy singularities do generate large 
symmetry breaking effects in that case. 

12. Although the above hypothesis is the most natural setting for an effective 
theory I can think of, it evidently involves assumptions which go beyond pure sym- 
metry and phenomenology. One may dismiss this hypothesis and the estimates for 
R obtained with it, requiring only that the mass term of the light quarks, which 
breaks the chiral symmetry of the QCD Hamiltonian, may be treated as a pertur- 
bation. With m u = 0, the lowest order formula for M^o — is off by a factor 
of four, in flat contradiction with this requirement. The disaster can be avoided 
only if the factor (m<2 — m u ) / (m^ + m u ) accounts for at least half of the discrepancy, 
(m d — m u ) /{m d + m u ) < |. This alone yields m u /m d > ~. 

13. Accordingly, if m u is assumed to be equal to zero, must be thrown over- 
board. The expansion in powers of the quark masses does then not make sense and 
the matrix elements of the scalar and pseudoscalar operators must exhibit extraordi- 
narily strong SU(3) breaking effects. These cannot be explained with the low energy 
singularities listed in the particle data tables. As far as I know, the instanton model 
of Choi et al. [27| is the only theoretical scenario proposed to substantiate the claim 



that QCD may give rise to strong flavour symmetry breakings of this type. There, 
the occurrence of discrete zero modes does indeed produce such effects. The same 
feature, however, is the reason why the model is inconsistent with spontaneously 
broken chiral symmetry. Dilute instanton coffee is undrinkable. Bold discussion re- 
marks may warm it up a little, but that merely intensifies the unacceptable flavour 
characteristics of the brewing. 

I am indebted to Hans Bijnens, Jiirg Gasser, Marina Nielsen and Daniel Wyler for 
valuable comments. 
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